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Abstract. We prove a general theorem providing smoothed analysis estimates 
for conic condition numbers of problems of numerical analysis. Our probabil- 
ity estimates depend only on geometric invariants of the corresponding sets of 
ill-posed inputs. Several applications to linear and polynomial equation solv- 
ing show that the estimates obtained in this way are easy to derive and quite 
accurate. The main theorem is based on a volume estimate of e-tubular neigh- 
borhoods around a real algebraic subvariety of a sphere, intersected with a disk 
of radius a. Besides e and cr, this bound depends only the dimension of the 
sphere and on the degree of the defining equations. 

1 Introduction 

In a seminal article [12] J. Demmel suggested that "to investigate the probability 
that a numerical analysis problem is difficult, we need to do three things: 

(1) Choose a measure of difficulty, 

(2) Choose a probability distribution on the set of problems, 

(3) Compute the distribution of the measure of difficulty induced by the distribu- 

tion on the set of problems." 

Then, for the measure of difficulty, Demmel proposed the condition number. This is 
a positive number which, roughly speaking, measures the sensitivity of the output 
to small perturbations of the input. It depends only on the input data and the 
function being computed. Condition numbers occur in endless instances of round- 
off analysis. They also appear as a parameter in complexity bounds for a variety of 
iterative algorithms. 
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The main results in [12] carry out an analysis as sketched in (l)-(3) above for 
the condition number ^ of several problems. This analysis exhibits bounds on the 
tail of the distribution of 'if (a), showing that it is unlikely that 'if (a) will be large. 
^From these bounds one can obtain, using standard methods in probability theory, 
bounds on the expected value of ln(^(a)), estimating the average loss of precision 
and average running time for algorithms solving the considered problem. Demmel's 
results thus yield prime instances of average-case analysis of algorithms in numerical 
analysis. 

While average-case analysis has undoubtedly advantages over worst-case analy- 
sis, it is not itself without shortcomings, the most noticeable being the arbitrariness 
of the selected probability distribution on the set of inputs. To find a way out of 
these shortcomings, D. Spielman and S.-H. Teng [30, §3] proposed a new form of 
analysis that arguably blends the best of both worst-case and average-case. The 
idea is to replace showing that 

"it is unlikely that "^(a) will be large" 

by showing that 

"for all a and all slight random perturbations Aa, it is unlikely that 
^(a + Aa) will be large." 

A survey of this approach, called smoothed analysis, can be found in [30, 33]. If 
^(c, (t) denotes a probability distribution centered at c G W'^^ with covariance 
matrix cr^idp+i, and E denotes mathematical expectation, we may summarize the 
objects of study of worst-case, average-case, and smoothed analyses, for a function 
ij; : MP+i M, in the following table. 



worst-case analysis 


average-case analysis 


smoothed analysis 


sup ip{a) 


aG^(0,<T) 


sup E ip{z) 



A remarkable feature of [12] is that the average-case analysis performed there 
for a variety of problems is not done with ad-hoc arguments adapted to the problem 
considered. Instead, these applications are all derived from a single result bounding 
the tail of the distribution of ^(a) in terms of geometric invariants (degree and 
dimension) of the set of ill-posed inputs of the problem for which is a condition 
number. 

A first goal of this paper is to extend the results of [12] from average-case to 
smoothed analysis. We will, however, also prove average-case bounds. Demmel's 
paper dealt with both complex and real problems. For complex problems he provided 
complete proofs. For real problems, Demmel's bounds rely on an unpublished (and 
apparently unavailable) result by A. Ocneanu on the volumes of tubes around real 
algebraic varieties. A second goal of this paper is to prove a result akin to Ocneanu's 
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(Theorem 1.2). We are not the first doing so. In [44], R. Wongkew gave a bound for 
the volume of tubes around real algebraic varieties. A number of constants in his 
bounds, however, are not explicit and only shown to be independent of the variety. 



1.1 Statement of the Main Result 

We assume our data space is RP"*"^, endowed with a scalar product ( , )• By a 
semi-algebraic cone S C we understand a semi-algebraic set S 7^ {0} that is 

closed by multiplications with positive scalars. We say that is a conic condition 
number if there exists a semi-algebraic cone E C the set of ill-posed inputs, 

such that, for ah data a G 1^^*+^ \ {0}, 



dist(a,S)' 

where || || and dist are the norm and distance induced by ( , ). 

The best known condition number is that for matrix inversion and linear equa- 
tion solving. For a square matrix A it takes the form k{A) = ||A||||^~^|| and was 
independently introduced by H. Goldstine and J. von Neumann [40] and A. Tur- 
ing [39]. Strictly speaking, k{A) is not conic since the operator norm || || is not 
induced by a scalar product. Replacing this norm by the Frobenius norm || \\f 
yields the (commonly considered) version kf{A) := of k{A). The Con- 

dition Number Theorem of C. Eckart and G. Young [14] then states that kf{A) 
is conic, with S the set of singular matrices. Other examples can be found in [9], 
where a certain property (related with the so called level-2 condition numbers) is 
proved for conic condition numbers. Furthermore, it is argued by Demmel in [11] 
that the condition numbers for many problems can be bounded by conic ones. 

Note that, since E is a cone, for all A > 0, ^(a) = ^(Aa). Hence, we may restrict 
to data a lying in the sphere := {x £ W+'^ \ \\x\\ = 1}. If we set := E U (-E), 
then the conic condition number can be estimated as 

^^"^ - dist(a,E,) " dp(a,E, n^P)' 

where dp denotes the projective distance in S^, which is defined as dp{x,y) = 
smdji{x,y) with dji being the Riemannian (or angular) distance in (cf. Fig- 
ure 1). 



3 




dnia, n SP) 



Fig. 1 Three distances 

Let Bp(a,a) denote the open ball of radius arcsino" around a in (a correspond- 
ing to projective distance). We will endow Bp{a,a) with the uniform probability 
measure. Moreover, let 



Op := vo\p{SP) 



r(^) 



denote the p-dimensional volume of the sphere S^. Our main result is the following. 

Theorem 1.1 Let ^ he a conic condition number with set of ill-poscd inputs S. 
Assume that S n S"^ C W where W O is the zero set in of homogeneous 
polynomials of degree at most d > 1 and W ^ S^. Then, for all a & S^, all 
ae (0,1], and all t> 1, 



p-i 



Prob {-Tfz) > n < 4 V r ) {2d)'' ( 1 + - 
and, for all a G (0, 1] , 



+ 



^ {2df 



ta J 



1 



sup E (In^(z)) < 21np + 21nd + 21n- + 5.5. 

aeSP zeBf{a,a) C 



In particular, for all t > 1 (take a = 1) 

p-i 

A'rf(z) >t} <4 

zesp 



Prob{*,.) > < (^) i2,f (l + J)'"' ^ + (2,r \ 



k=l 



and 



E (ln<r(z)) < 21np + 21nd + 5.5. 

zeSp 
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The main idea towards the proof of Theorem 1.1 is to reformulate the probabihty 
distribution of a conic condition number as a geometric problem in a sphere. Indeed, 
for a measurable subset V of we denote by vo\p{V) the p-dimensional volume of V. 
If —V = V we define the e -neighborhood around V in S'^ by 

TwiV,£) := {xeSP\ dwix,V) < £}. 

With this notation, using n 5^ C W, we obtain from (1) for a e and a G (0, 1] 

Prob I ^(z) > - 1 < Prob {dp{z, n SP) < e} < Prob {dp(z, W) < e} 

voip(rF(w,£)n^p(a,(7)) 

volp(Bp(a,(T)) 

The tail bounds in Theorem 1.1 will thus follow from the following purely geometric 
statement. 

Theorem 1.2 Let W C be a real algebraic variety defined by homogeneous 
polynomials of degree at most d > 1 such that W ^ S^. Then we have for a e 
and < e,a < 1 



jU — 1 N / ir 



volpBp(a,cr) \kj V a J \crJ Cp-i ^cr7 

In particular, (take a = 1) 



O. 



p 



Here is a brief outline of the proof of Theorem 1.2: The first step is an upper 
bound on the volume of an e-neighborhood of a smooth hypersurface in terms of 
integrals of absolute curvature (Proposition 3.1). This is a variation of H. Wcyl's [41] 
exact formula for the volume of tubes, a formula which, however, only holds for 
sufficiently small e. Then (Proposition 3.2) we derive a degree bound on these 
integrals of absolute curvature based on the kinematic formula of integral geometry 
and Bezout's theorem. Finally we get rid of the smoothness assumption by some 
perturbation argument. 

We will devote Section 4 to derive applications of Theorem 1.1 to several con- 
dition numbers occuring in the literature, namely, those for linear equation solving, 
eigenvalue computation, polynomial system zero finding, and zero counting. 

Remark 1.3 Theorem 1.2 could be stated for real projective space with the 

same bounds. While such a statement is the most natural over the complex numbers 
(cf. [1] and [6, Theorem 1.3]) it does not follow the tradition over the reals (cf. [12, 
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41, 44]) and it is not a natural ambient space for real conic condition numbers. Note 
that S is not necessarily symmetrical around the coordinate origin and that the use 
of a symmetric W (an algebraic cone containing the semi-algebraic cone S) is just 
an artifice of our proofs. 

1.2 Relation to previous work 

In most instances of smoothed analysis (e.g. [10, 13, 30, 31, 32, 45]) one studies the 
behaviour, for a function : MF'^^ — R, of 

sup E tp{z) (2) 

agRp+l zeATP+l (0,0-2) 

(possibly for sufficiently small a) where NP~^^{a,a^) denotes the p+ 1-dimensional 
Gaussian distribution over R with mean a and variance cj^. It has been argued that 
smoothed analysis interpolates between worst and average cases since it amounts to 
the first for cr = and it approaches the second for large a. 

When ip{Xx) = ijj{x) for all A > — e.g., a conic condition number — it makes 
sense to restrict ip to the sphere S^. In this case, it also makes sense to replace 
the distribution NP~^^[a,a'^) by the uniform distribution supported on the disk 
Bf{a,a) C SP and to consider, instead of (2), the following quantity 

sup E tp{z). (3) 

aeSP zeBp{a,a) 

Note that in this case, the interpolation mentioned above is transparent. When 
(7 = the expected value amounts to tpia) and we obtain worst-case analysis, while 
if (7 = 1 the expected value is independent of a and we obtain average-case analysis. 

It is this version of smoothed analysis, introduced in [6], which we deal with in 
this paper. Note that while, technically, this "uniform smoothed analysis" differs 
from the Gaussian one considered so far, both share the viewpoint described above. 

We have already mentioned the references [10, 13, 30, 31, 32, 45] as instances of 
previous work in smoothed analysis. In all these cases, an ad hoc argument is used 
to obtain the desired bounds. This is in contrast with the goal of this paper which 
is to provide general estimates which can be applied to a large class of condition 
numbers. We believe the applications in Section 4 give substance to this goal. 

We finish this section with a brief overview on previous work on the relations be- 
tween complexity, conditioning and probabilistic analysis. In [3], L. Blum suggested 
a complexity theory for numerical algorithms parameterized by a condition number 
"^(a) for the input data (in addition to input size). S. Smale [29, §1] extended this 
suggestion by proposing to obtain estimates on the probability distribution of "^(a). 
Gombining both ideas, he argued, one can give probabilistic bounds on the com- 
plexity of numerical algorithms. 

The idea of reformulating probability distributions as quotients of volumes in 
projective spaces (or spheres) to estimate condition measures goes back at least to 
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Smale [28] and Renegar [20]. In particular, J. Renegar [20] uses this idea to show 
bounds on the probabiUty distribution of a certain random variable in the average- 
case analysis of the complexity of Newton's method. Central to his argument is the 
fact that this random variable can be bounded by a conic condition number. The 
set of ill-posed inputs in [20] is a hypersurface. An extension of these results to the 
case of codimension greater than one was done by Demmel [12] where, in addition, 
an average-case analysis of several conic condition numbers is performed. Most of 
these results are for problems over the complex numbers. An extension in another 
direction, namely, to possibly singular ambient spaces, was done by C. Beltran and 
L.M. Pardo [1]. Another extension of Demmel's result, now to smoothed analysis 
for complex problems, was achieved in [6]. 

The remaining of the paper is organized as follows. In Section 2 we provide the 
preliminary notations and results needed to prove Theorem 1.2. These are mostly 
taken from differential and integral geometry. In Section 3 we prove Theorem 1.2 
and Theorem 1.1. Section 4 is devoted to several applications of our main result. 
Finally, the precise value of some constants — whose existence is well-documented 
in the literature but whose magnitude isn't — is derived in an appendix. 

2 Preliminaries 

2.1 Distances, volumes, and tubes on the sphere 

The p-dimensional sphere carries the structure of a compact Riemannian man- 
ifold. Correspondingly, there is a Riemannian distance dR{x,y) € [0,7r] between 
two points x,y e S^, which is just the angle between these points. We denote by 
Bji{a,a) = {x e SP \ dii{a,x) < a} the open ball of radius a centered at a G S^. 

It will be more natural for us to work with the related notion of projective 
distance, which is defined as dp{x,y) := smdji{x,y) G [0, 1] (cf. Figure 1). We note 
that dp satisfies all the axioms of a metric, except that (ip(x, y) = iff a; G {—y, y}- 
In fact, dp induces a metric on the real projective space P*' (obtained from by 
identifying antipodal points). However, we prefer to work on the sphere, which 
seems more intuitive to us. 

Let F be a symmetric subset of S^, that is —V = V. For < e < 1 we define 
the e -neighborhood around V by 

Tp{V,£) := {xeSP\ dp{x, V) < e}, 

where df>{x,V) := iniy^v dp{x,y). This equals the a-ncighborhood of V defined 
with respect to dji, where a = arcsine. However note that Tp({a},e) = Bji(a,a) U 
a, a). Somewhat inconsistently, we write Bp{a,£) := Bji{a,a) and call this 
the open ball of radius e with respect to the projective metric. 

For a measurable subset A C 5*^ we write volp^ = J"^ dS'^ for the p- dimensional 
volume, where dS^ denotes the volume form induced by the Riemannian metric. 
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In order to compute volumes of balls and tubes around subspheres, the following 
functions Jp^k{oi) are relevant: 

Jp,ik(a) := r{smpf-^ {cos pf-'^ dp {1 < k < p). 
Jo 

Lemma 2.1 For l<k<p,0<a< Tr/2, and e = sin a, we have 

volpTp(S'P-^,£) = Op_kOk-iJp,k{a), yo\pBR{a,a) = Op-iJp,p{a). 

Proof. This follows from [41] or by straightforward calculation. □ 

The quantity Jp,k{ct) can be easily bounded. Recall that Op denotes the p- 
dimensional volume of S^. 

Lemma 2.2 The following estimates hold (l<k<p, 0<a< Tr/2, e = sinaj; 

£P O 
Jp,k{oi) < ifk<p, —< Jp,p{a) < eP 

with equality when a = 7r/2 in the last upper bound. 
Proof. To settle the first inequality note that for A; < p 

ra /■e ^k 

Jp,k{^) - / (sinp)^"^ (cosp) dp = / u^~^du = —. 
Jo Jo ^ 

Similarly, 

ra pa ^p 

Jp^p{a) = / {sinp)P^^dp> / (sinp)^~^ cospdp = — . 

Jo Jo P 

It is easy to check that a — > Jp^p{a) {sin a)~P is monotonically increasing on [0,7r/2] 
by computing the derivative of this function. Hence, Jp{a){sina)~P < Jp^p(7r/2). 
^Prom Lemma 2.1 we get ^Op = volpSfl(a, 7r/2) = Op-\Jp^p{'K /2) from which it 
follows that Jp,p(7r/2) = 20~i- '-' 

In this paper, the notions of manifold and differentiability always refer to C°°- 
differentiability. For a submanifold M of and 0<a<7r/2we define the a-tube 
T^{M,a) around M by (compare with [16, p. 34]) 

T^{M, a) := {x E \ there is a great circle segment in of length < a 

from a; to M that intersects M orthogonally}. 

Here we used the Riemannian distance. Sometimes, when thinking in terms of the 

projective distance and M = —M, it will be convient to use the notation of e-tube 
r/(M,£) := r^(M,arcsin£) defined for < £ < 1. Clearly, T^{M,£) can be 
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characterized in a way similar to T^{M, a). We note that Tp (M, s) C Tjp{M, e) and 
the inclusion is in general strict. It can be shown, however, that if M is a compact 
submanifold, then equality holds. 

We also need the notion of the m-dimensional volume (Hausdorff measure) of 
subsets T of SP. For simplicity, we restrict ourselves to semialgebraic sets, cf. [5]. 
Let T be an m-dimensional semialgebraic subset of S^. The Zariski closure W oi T 
in is a real algebraic variety of dimension m and its regular locus Reg(M^) is an 
m-dimensional submanifold of S^. We define the m-dimensional volume of T by 
vol^T := volj„(r n Reg(T^)). This makes sense since T \ Reg(T^) has dimension 
strictly less than m. For k < m we set volfeT := 0. 

2.2 A useful transformation formula 

We will repeatedly use the following special case of the coarea formula. A proof can 
be found in [37, III§2, Folgerung 1] or [17, Appendix]. 

Proposition 2.3 Let M, N he Riemannian manifolds of the same dimension and 
cp: M ^ N be differentiable. Suppose that Jj^ \ det Dip\ dM is finite. Then the fiber 
(p~^{y) is finite for almost all y e N and we have 



2.3 Some differential geometry of hypersurfaces on spheres 

For the following material from differential geometry wc refer to [35, 38]. 

In the following let M be a compact oriented smooth hypersurface of inter- 
preted as a Riemannian submanifold. The orientation corresponds to the choice of a 
unit normal vector field u: M ^ W^^ on M. The Weingarten map Lm{x) : T^M 
TxM of M at x is the linear map defined by Lm{x) := —Du{x) (it is easy to verify 
that this is a well-defined map). The second fundamental form, of M at a: € M is the 
corresponding bilinear map ^m{x): T^M x TxM —>■ R, defined by ^m{x){Y, Z) = 
{Lm{x)Y, Z) for all Y,Z ^ T^M. We are going to describe these notions in terms of 
local coordinates of M. Thus let v = . . . , Vp-i) ^ x{v\, ... , Vp-i) G M C 1^^+-^ 
be a local parametrization of M. Then 



where the last equality follows from deriving {dx / dvj,v) = 0. In particular, JlMix) 
and Lm{x) are symmetric. The eigenvalues ki(x), . . . , Kp-\{x) of Lm{x) are called 
principal curvatures at x of the hypersurface M. 

Example 2.4 Consider the case of M = SP~^, the subsphere of given by the 
equation Xp = 0. Then it easy to see that 11^(3;) = for all x G M. Hence all 
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the principal curvatures of M are zero. This example makes clear that the principal 

curvatures are relative to the ambient space S^. (Of course, SP~^ is curved; however, 
its "curvature relative to the ambient sphere" is zero.) 

For 1 < i < pwe define the ith curvature KM,i{x) of M at x as the ith elementary 
symmetric polynomial in ki(x), . . . ,Kp_i(x), and put KMfi{x) := 1. In particular, 
Km,p-i{x) = det Lm{x). Note that the ith curvatures are essentially the coefficients 
of the characteristic polynomial of the Weingarten map: 

p-i 

det(idp_i + tLuix)) = Y^t^ KM,i{x). (4) 

i=0 

Definition 2.5 Let M be a compact oriented smooth hypersurfacc M of 5"^ and U 
be an open subset of M. The integral Hi{U) of ith curvature and the integral 
of ith absolute curvature over U, with respect to the ambient space M, are defined 
as (0 < z < p - 1) 

Hi{U) := I KM,idM, and \ni\{U) := f \KM,i\dM. 
Ju Ju 

A few remarks: is monotone in the sense that |//j|(C/i) < \fJ-i\{U2) for Ui C 1/2- 
Also, note that \lJ^iiU)\ < \ni\iU) and ^{9) = |//i|(0) = 0. Moreover, noiU) = 
\fio{U)\ = volp_i(C/). By Example 2.4, \fii\{SP-^) = for i > 0. 

Example 2.6 Consider the boundary Ma of the ball Bji{q, a) in of radius < 
a < 7r/2 centered at q. Clearly, Mq, is an {p — l)-dimensional sphere of radius sin a 
that is described by the equations 

xq = cos a, Xi + --- + Xp = sin^ a 

if g = (1,0, . . . ,0). We orient by the unit normal vector field on pointing 
towards q. It is straight-forward to see that the second fundamental form of Ma 
satisfies JlM^ix) = (cot a) idp_i for all x G Mq. 
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Fig. 2 The manifold M, 



Hence all the principal curvatures of Mq, at x are equal to cot a. Therefore the 
ith curvature of satisfies KMa,i{x) = (^7^) (cot a)*, a quantity independent of 
X G Mq. For the integral of the ith curvature we obtain 

lii{M^) = KM^,i^fo\p.iMa =(^~ ^^Op_i(sinaf-*-i(cosa)\ (5) 

using the fact that volp_iMo, = Op_i(sina)P~^. In particular, note that jj,p-\{Ma) = 
Op-i{cosaY^^ . Finally note that iii{U) = \iii\{U) for all open subsets U of Mq, 
since all the principal curvatures are nop-negative. 



2.4 A kinematic formula from integral geometry for spheres 

We recall here a basic formula of integral geometry. The orthogonal group G = 
0{p + 1) operates on in the natural way. We will denote by dG the volume 
element on the compact Lie group G normalized such that the volume of G equals 
one. We will interpret as a submanifold of 5'^' for i < p, e.g., given by the 
equations Xj+i = ■ ■ ■ = Xp = 0. 

Let M be a compact oriented smooth hypersurface of S'^. It follows by standard 
methods from Sard's lemma [34] that gM intersects S"*"*"^ transversally for almost 
all g e G. Hence, for almost all g & G, the intersection gM n S^~^^ is either 
empty or a smooth hypersurface of S^'^^. Moreover, this intersection inherits an 
orientation from M in a natural way as follows: let u be the distinguished unit 
normal vector field of M. Then we require that the distinguished unit normal 
vector of the hypersurface gM n S^~^^ in 5*+^ at x lies in the positive halfspace of 
TxM determined by u. 

Therefore, for almost all g E G, the integral of the ith curvature fii{gM n S'^'^^) 
of gM n S^~^^ , considered as submanifold of S^~^^, is well defined, and this is also the 
case for fii{gU n 5*"*"^) when U denotes an open subset of M. 
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We will need the following special case of the principal kinematic formula of 
integral geometry for spheres. A proof can be found in [17]. For Euclidean space a 
corresponding result was stated by Chern [8]. 

Theorem 2.7 Let U be an open subset of a compact oriented smooth hypersur- 
face M of and < i < p — 1. Then we have 

HiiU) = C{p, i) [ ^li{gU n S'+') dG{g), 
JgeG 

where Cip,^) = ip-^- 1) (^T^) a^g^^^^. □ 

While the existence of the constants C{p,i) follows from [8, 17], it is quite cum- 
bersome to extract explicit formulas for C(p,i) from these sources. This is partly 
due to the fact that the quantities /Xj (and even Oi) have slightly different meanings 
in the literature. For the convenience of the reader, we have therefore included a 
short derivation of these constants in the appendix. 

For future reference, we state the case i = of Theorem 2.7 in slightly more 
general form. 

Corollary 2.8 For any semialgebraic subset T of such that dimT < p — 1 we 
have 

volp_ir=%i/ \TngS'\dG{g). (6) 
^ JgeG 

Proof. Using the comments given at the end of §2.1, it is easy to reduce to the 
case where T is an open subset of a hypersurface of S^. Now apply Theorem 2.7 
for i = 0, taking into account that neither the compactness nor the orientability 
assumption are necessary in that case, cf. [21, §15.2]. □ 

3 On the volume of tubes around real algebraic sets 

The goal of this section is to provide the proof of Theorem 1.2. 

3.1 Bounding the volume of tubes of smooth hypersurfaces 

In an important paper, Weyl [41] derived a formula for the volume of tubes around 
a compact submanifold of Euclidean space or a sphere. However, this formula only 
holds for a sufficiently small radius. The following proposition gives an upper bound 
on the volume of tubes around a hypersurface that holds for any radius. Compare 
also Gray [16, Theorem 8.4, (8.6), p. 162]. 
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Proposition 3.1 Let M be a compact oriented smooth hypersurface of and U 
be an open subset of M. Then we have for all < a < Tr/2 



p-i 



i=0 



Proof. Let u: AI ^ S'^ he the unit normal vector field on M corresponding to 
its orientation. For x G M consider the following parametrization 



x + ti'{x) x + ti'{x) 



\x + tiy{x)\\ (1 + ^2)5 



of the half great circle intersecting M at x orthogonally. Note that dji{x,ipx{t)) 
arctan t. 

t = tan(a) 

X + tu{x) 




Fig. 3 The point (fx{t) and the magnitudes t and a 
Consider the following differentiable map of Riemannian manifolds 

M xR^ S^\{x,t) ^ ipx{t). 

Let U be an open subset of M, < a < vr/2, and put r = tana. Wc denote by 
r+(C/,a) and T^{U,a) the images of C/ x (0, r) and U x (— r, 0) under the map ip, 
respectively. Clearly, T^{U, a) = U U T+{U, a) U T^{U, a). 

We apply the transformation formula of Proposition 2.3 to the surjective differ- 
entiable map X (0, r) T^{U,a) of Riemannian manifolds. This yields 



L 



1 det D(p\ d{M x 



IUx{0,t) JyeT+{U,a) 

By Fubini, /^^^^ | det Dip\d{M xR) = £ g{t) dt, where 



g{t):= \ det Dip\{x,t)dM{x). 
Jxeu 
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Claim A. The determinant of the derivative Dip{x, t) of at (x, t) G M x M satisfies 
\detD^ix,t)\ = ^^^^2)(p+i)/2 \det{idT^M-tLM{x))\. (7) 
Using this claim, whose proof is postponed to the end, we obtain 

g{t) = / ^-^\det{idT,M-tLMix))\dM{x) (by Claim A) 

Jxeu (1 +t2) — 

^ E— / \KM,i\dM (by (4)) 

By making the substitution t = tan p (recall r = tan a) we get 

Jo (l + t2)(p+i)/2 ^^^yo (cosp)*'~*~^(sinp)Mp = Jp,i+i(a). 
Altogether we conclude that 

volpr+(f/,a)< / 5(t)dt < VJp,^+l(a)lwl(^^)• 
•^o i=0 

The same estimate can be shown for volpT^ [U, a), which implies the desired estimate 
of volpT^ (;/,«). 

It remains to prove Claim A. Choose a local parametrization x = x{v) e MP'^^ 
of M in terms of coordinates vi, . . . , fp-i defined in a neighborhood of 0. We assume 
that du^x^ • • • 5 ^Vn- 

_-^x are orthonormal at 0. Abusing notation we will interpret 
V = ^{v) as a function of v. The matrix (Ajj) of Lm with respect to the basis d^.x 
of rj;M is given by -d^^v = J2j KjdvjX. 

Consider the map {v,t) hh- R{v,t) := x{v) + ti^{v) G W+^. Then 

{v, t) ^ ^Piv, t) := ipix{v),t) - 



(l+t2)2 



provides a local parametrization of S'^. In the following let [R, dtR, d^^R, ■ ■ ■ , 5^p_i-R] 
denote the square matrix of size p+1 whose rows are R and the partial derivatives 
of R. Using the multilinearity of the determinant and the fact 

dfip = (1 + t^Y^'^dtR - t{i + 12)-3/2^ 
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we obtain by a short calculation that 

\det Dip{v,t)\ = \det[tp,dti^,dy^i;,...,dy^_^^]\ 

= (l+^2)(p+l)/2 I [-^' ^vrR,---, R] I ■ 

Computing partial derivatives we get ^v^R = d^.x + td^.v and dtR = u. Using 
dyM = — XijdyjX, we get from this 

9viR = J2j{^ij - t\ij)dy.x. 

Hence we obtain 

det[R,dtR,dvj^R,...,dy^_-^R] = det[x + tiy,u,dyj^R, . . . ,dv^_^R] 

= det [x, u, dyj^R,... , dv^_^ R] 
= det{6ij - tXij) det[x, u, d^^x, ... , dy^_-^^x]. 

Since we assume that dy-^x, . . . , dy^_-^x are orthonormal at = 0, we conclude (using 
the chain rule Di/j = DipDx) that 

\det D<p{x,t)\ = \detDi;{0,t)\ = det(% - Uy)!, 

which shows Claim A. □ 



3.2 Bounding integrals of absolute curvature in terms of degree 

In this section let / G IR[^05 • • • > ^p] be homogeneous of degree d > with nonempty 
zero set V C. such that the derivative of the restriction of / to does not vanish 
on V. Then F is a compact smooth hypersurface of S^. We orient V by the following 
unit normal vector field (Gauss map) 

v. V ^ S^, i/(x) = ||grad/(x)||~^ grad/(x). 

The goal of this section is to bound the integrals of absolute curvature on patches 
of V. 

Proposition 3.2 For a G S^, < a < 1, and < i < p we have 

\fii\{VnB^{a,a)) < 2(^~^^Op.id'+'aP-'-\ 

The proof is based on the following lemma. 
Lemma 3.3 We have |//p_i|(F) < Op-idP. 
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Proof. Recall that the determinant of the linear map Lv{x) = —Dv{x) : T^V 
TxV equals Ky^p-i (cf. §2.3). We may assume without loss of generality that the 
open subset U := {x ^ V \ rank(Z)i^(,T)) = p — 1} of y is nonempty (otherwise 
lip-i(y) = 0). We would like to apply Proposition 2.3 to the restriction of v to U, 
but face the problem that it is only an immersion so that ^{U) might not be a 
submanifold of S'P. In order to circumvent this, we use some standard facts of real 
algebraic geometry [5]. 

Consider the Zariski closure W of i'{U) in S^, which is a real algebraic variety 
of dimension p — 1. Its regular locus Wi is a submanifold of of dimension p—l. 
Consider the open subset Vi := U Cl h'~^{Wi) of V and the restriction z^i : Vi — > Wi 
of I'. The singular locus Sing(VF) = W \ Wi is an algebraic subset of dimension 
strictly less than dimVF. Since vi is an immersion, we conclude that U \ Vi = 
z/f ^(Sing(M^)) has dimension strictly less than p — 1. We therefore obtain 

\iip-i{v)\ = \iip-i{u)\ = \iip-i{yi)\. 

Applying the transformation formula of Proposition 2.3 to vi we get 

\l^p-i{V)\= [ \detDui\dV=[ \u^\y)\dWi{y). 
JVi JyeWi 

Consider for ^ G N U {oo} the semialgebraic sets := {y G Wi \ = £}■ 

Since the above integral is finite, volp_iFoo = 0, and therefore dimFoo < p — 1. We 
obtain 

[ \i^^\y)\dWi{y) = ^£volp_iF^ 
JyeWi 

Corollary 2.8 applied to F£ yields the following 

volp_iF, = %i / \FengS'\dGig). 
^ JgeG 

Combining these findings we get 
K_i|(F) = %i r J2^\FengS'\dG{g) = ^ [ \i^^\gS')\dG{g). 

^ JgeG ^ JgeG 

(In order to see the last equality use that gS^ does not intersect F^c almost surely.) 

It is now sufficient to prove that \i'~^{gS^)\ < 2# for almost all g G G. To sim- 
plify notation suppose first that g = id. Let yg, . . . ,yp denote coordinate functions 
on RP+i and = {y e SP \ y2 = ■■■ = yp = 0}. A point x G W+'^ lies in 1^-^(5^) 
iff it satisfies the following system of equations 

Y^xj-1 = 0, fix) = 0, d2fix) = ■■■ = dpf{x) = 0. 
i 

If all real solutions of this system of equations are nondegenerated, then they are 
isolated in C^+-'^. By Bezout's theorem [22] the number of these solutions is bounded 

by 2d{d — 1)^"^ < 2dP. However, one can show along the lines in [19], that the 
nondegeneracy condition is satisfied for almost all g & G. This finishes the proof. □ 
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Proof of Proposition 3.2. Put U := FnSp(a, a). The case i = p-lis already 
settled by Lemma 3.3, as \fii\{U) < \fii\{V). So we may assume i < p — 1. 

Let U-^- be the set of points of U where Kv,i is positive and similarly define U- 
where Kv,i is negative. Then |//j|([7) = \ni{U^)\ + |//i(C/_)|. 

Let g e G = 0{p+l) such that V intersects gS'''^^ transversally and such that the 
intersection is nonempty. Then VDgS^'^^ is the zero set in gS'^'^^ of the homogeneous 
polynomial / of degree d. By transversality, the derivative of the restriction of / to 
gS^~^^ does not vanish on F n gS^~^^. Hence we may apply Lemma 3.3, which gives 
the estimate 

\l,i\{VngS'+^)<Oid'+\ 

By the monotonicity of (now refering to the hypersurface V fl gS^'^^ of gS^'^^) 
we have 

\fii{U+ngS'+')\ < \i^i\iU+ngS'+') < \i^i\iVngS'+^) < Oid^+\ 
The Kinematic Formula of Theorem 2.7 implies that 

\^Jii{U+)\<C{p,^) [ \fii{gU+nS'+^)\dG{g). 

JgeG 

Therefore, we obtain 

\liiiU+)\ < C{p,i)OiS+^ Prob{5f/+n 5^+1/0}, 

where the probability is taken with respect to the uniform distribution in G. We 
may estimate this probability as follows (put a = arcsin u) 

Prob{Sp(5a, (t) n 7^ 0} = Prob{Sp(a', a) n S'^^ ^ 0} 

gSG a'&SP 

- — o; — •^p.p— — — 

where we used Lemmas 2.1 and 2.2 for the last two steps. Multiplying this with the 
formula for C{p,i), the expression miraculously simplifies and we get 

The same estimate can be shown for |//j|(J7_), which proves the proposition. □ 

3.3 Extension to the non-smooth case: proof of Theorem 1.2 

The following proposition estimates the volume of the tube around a patch of a 
smooth hypersurface in the sphere. 
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Proposition 3.4 Let f G M[Xo, . . . , Xp] be homogeneous of even degree d > Q with 
zero set V = 2{f) in S^. Assume that the derivative of the restriction of f to 
does not vanish on V. (Thus V is a smooth hypersurface in S^.) Let a E and 
0<e,a <1. Then 

P k=i ^''^ 

Proof. Put U ■—Vn Bp{a, a). Take < a < 7r/2 such that e = sin a. Proposi- 
tion 3.1 imphes 

p-i 

yo\pT^{U,s) = yo\pT^{U,a) < 2^ Jp,i+i(a) \fii\iU). 

1=0 

Combining this with Proposition 3.2 we obtain 

yo\pT^{U,e)< 4^ ( ^ . ] Op^id'+' a^-'-' Jp,i+i{a) . 
i=o ^ ^ ^ 

Using the estimates of Lemma 2.2 we obtain (put k = i + 1 and consider separately 
the term for k = p) 

k=l P 

Now use (^Ii) = |(^) to get the desired upper bound on vo\pT^{U,e). □ 



Proof of Theorem 1.2. We have to remove the smoothness assumption in 
Proposition 3.4 and to estimate the volume of the £-neighborhood instead of the 
£-tube. 

Assume W = Z(fi, . . . , fr) with homogeneous polynomials fi of degree di. Then 
W is the zero set in SP of the polynomial 

r 

f{X) :=^/,(X)2||Xf<^-2*, 

i=l 

which is homogeneous of degree 2d. Our assumption W ^ SP implies dim W < p. 

Let (5 > be smaller than any positive critical value of the restriction / : 5^ ^ M 
of f to SP. Then Dg := ^ SP \ f{$,) < 5} is a compact domain in SP with smooth 
boundary 

dDs = UeSP\ fiO = S}. 
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Indeed, the derivative of / — 5 does not vanish on dDs (use J2i Xidif{x) = 2df{x)). 
Moreover, note that W = r]s>oDs and hence lim.s_,Q\/o\pDs = volp(VF) = 0, since 
dim W < p. 

Claim B. We have Tp{W, e)CDsU Tp{dDs, e) for < e < 1. 

In order to see this, let x G Tp(W,£) \ Ds and 7: [0, 1] be a segment of 

Riemannian length less than arcsine such that 7(1) = x and 7(0) G W. Consider 
F: [0,1] R,F{t) := f{-f{t)). By assumption F{1) = f{x) > 5 and F(0) = 0. 
Hence there exists r G (0, 1) such that F{t) = 6. Thus j{t) G dDg and dDs) < 
dp(x,7(r)) < £, which shows the claim. 

Using the triangle inequality for the projective distance, it is easy to see that 
(cf. Figure 4) 

Tp{dD5,e)nBw{a,a) CT^{dDsnBp{a,a + e),s). (8) 



dDs 




Fig. 4 The shaded area is Tp{dDs, e) n Bp{a, a) 

Combining (8) with Claim B we arrive at 

Tp{W, e) n Bp{a, a)CDsU T^{dDs n Bp{a, a + e), e). 

We apply Proposition 3.4 to 1^ = dDs = Z{f — 5\\x\\'^'^) intersected with the ball 
Bf>{a, a + e). This implies 

p—i 

yo\pT^idDsnB^{a,a+e),s) K^^PzlJ^ff) {2df {a+eY-^+20p{2dY {a+eY . 

P k=i 
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Taking into account that yo\pBr(a, a) > Op-i^ (cf. Lemmas 2.2 and 2.1) we obtain 

yo\p{Tp{W,e)nBp{a,a)) ^ volpDs ^ yo\pT^{dDs n Bp{a,a + e),e) 



vo\pBp{a, a) 



volpSp(a, a) 



volpSp(a, a) 



Taking the hmit for J the first term vanishes and the assertion follows. □ 



3.4 Estimating expected values: proof of Theorem 1.1 

The tail bounds in Theorem 1.1 follow from Theorem 1.2 as indicated in the intro- 
duction. It thus suffices to show how to derive the claim on expected values from 
the tail bounds. This is achieved by the following proposition. 



We 



use the inequality ^ — ^ ^ ^, valid for p > 2, which implies < P + 2. 

Proposition 3.5 For < a < 1 let > 1 be a random variable satisfying, for all 
<e<l andp>2, 

Prob{X, > 1/e} < (^) {2d)' (l + (-)' + 2(p + 2) (2d)^' (-)' . 

fc=i ^ ^ ^ ^ 

Then, for e < (i_|_2j)(p_i) ? -have Prob{Xo- > 1/e} < {8e + 4)dp^ and 
F,{lnX^) < 21np + 21nd + 21n- + 5.5. 



Proof. ProbjXo- > 1/e} is bounded by 

p 



a 



< 



fe=i ^ ^ 

k=l ^ ^ 
I — n \ / 



= 8d 



a 

8dps 



Using that e < 



(l+2d)(p-l) 



this can be further bounded to obtain 



Prob{X^ > 1/e} < 



8dpe 



a 



1 + 



1 y-^ 1 

+ 2 



p — 1 



8dp, r 
a I 
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For the bound on the expectation we use Proposition 2.4 in [6] which imphes 
E(1„X,) < ln( <^ + ^''^'^-^ )+ln(gjg(.+ l))+l 

< 21np + 21nd + 21ni + ln(3 -8(6 + ^)6) since 1 + 2d < 3d 

a 2 

< 21np + 21nd + 21n- + 5.5, 

a 

which completes the proof. □ 



4 Applications 

We give several applications of Theorem 1.1 to smooth analysis estimates for the 
condition numbers of the following problems: linear equation solving, eigenvalue 
computations, real polynomial equation solving, and zero counting. 

4.1 Linecir equation solving 

The first natural application of our result is for the classical condition number k,{A). 
In [45], M. Wschebor showed (solving a conjecture posed in [30]) that, for all n x n 
real matrices A with \\A\\ < 1, all < cr < 1 and alH > 

Kr> 

Prob {k{Z) >t)< — 

with K a universal constant. Hereby, \\A\\ stands for the operator norm with respect 
to euclidean norm. Note that, by Proposition 2.4 in [6], this implies 

sup E (InK(Z)) < Inra + ln- +lnE:+ 1. 

||A||<1 ZGN'^'^{A,a'^) 

We next compare Wschebor's result with what can be obtained from Theorem 1.1. 
Let II^IIf denote the Probenius norm of a matrix A G M*^^", which is induced by 
the inner product (^, i— >^ trace(AB-^). We have 

k{A) = \\A\\\\A-^\\ < \\A\\f\\A-^\\ =: KpiA). 

The Condition Number Theorem of Eckart and Young [14] states that H^^"*^!! = 
dF{A,T,)~^ where S C M"^" denotes the set of singular matrices and dp is the 
distance induced by the Probenius norm (see also [4, Thm. 1, Ch. 11]). It follows 
that Kp is a conic condition number. We can thus give upper bounds for Kpi-A) and 
they will hold as well for k{A). 
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Corollary 4.1 For all n > 1, < a < 1, we have 

sup E (ln«;i;'(Z)) < 61nra + 21n- + 5.5, 

where the expectation is over all Z uniformly distributed in the disk of radius a 
centered at A in the sphere S"' ~^ endowed with the projective distance. 

Proof. The variety S of singular matrices is the zero set of the determinant, 
which is a homogeneous polynomial of degree n. We now apply Theorem 1.1 with 
p = -n? — 1. □ 

Note, the bound in Corollary 4.1 is of the same order of magnitude than Wsche- 

bor's, worse by just a constant factor. On the other hand, its derivation from 
Theorem 1.1 is rather immediate. We next extend this bound to rectangular matri- 
ces. 



4.2 Moore-Penrose inversion 

Let £ > m and consider the space R^^"* of ^ x m rectangular matrices. Denote by 
E C M^^"^ the subset of rank-deficient matrices. For ^ S let denote its Moore- 
Penrose inverse (see, e.g., [2, 7]). The condition number of A (for the computation 
of A^) is defined as 

A,,, ||(A + AA)t-At||.p|| 
condUA) = hm sup — 1, i, ^ , 

This is not a conic condition number but it happens to be close to one. One defines 
K^A) = Pllptll and, since = dp(A,S)-i [15], we obtain 



dF{A,J:) 
In addition (see [36, §111.3]), 

kT(A) < condt(A) < ii-^K^(^). 

Thus, ln(cond^(y4)) differs from lrL{K'^{A)) just by a small additive constant. As for 
square matrices, k\A) is not conic since the operator norm is not induced by an 
inner product in M^^"*. But, again, we can bound k^{A) by the conic condition 
number — PIIfP^II > k^A). 

Corollary 4.2 For all I > m > 1 and < a < 1 we have 

sup E (lnKj,(Z)) < 21n^-|-41nm-|-21n- -1-5.5. 
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Proof. If a matrix A is rank deficient then det^o = where Aq is the m x m 
matrix obtained by removing all rows of A with index greater than m. Therefore 
S C So = {A G M^^™ I det^o = 0}. This implies that < ^^^j^^^^^ for 

ll^lli? = 1. Since Sq is the zero set of a homogeneous polynomial of degree m, an 
immediate application of Theorem 1.1 with W = Tiq yields the claimed bound. □ 



4.3 Real Eigenvalue computations 

Let A G and A G M be a simple real eigenvalue of A. Suppose that x G and 

y G are right and left eigenvectors associated to A, respectively (i.e., nonzero and 
satisfying Ax = Xx and y'^A = \y^). ^From the fact that A is a simple eigenvalue 
one can deduce that {x,y) ^ 0, cf. Wilkinson [43]. 

For any sufficiently small perturbation /S.A G M"^" there exists a unique real 
eigenvalue A + AA of ^ + close to A. We thus have 

{A + A^)(x + Ax) = (A + AA)(a; + Ax), 

which implies up to second order terms A^x+^ Ax ^ AAx+A Ax. By multiplying 
with from the left we get 

AA=^y^AAx + o(||AA||). 

Moreover, sup||/^^||^<i ly-^A^x] = ||x|| \\y\\. 

It therefore makes sense to define the condition number of A for the computation 
of A as follows 

and to set k{A, A) := oo if A is a multiple eigenvalue of A. 

If A has real eigenvalues, we could define the condition number of A for real 
eigenvalue computation as the maximum of k{A, A) over all the real eigenvalues A 
of ^. In order to arrive at a reasonable definition that makes sense also for matrices 
having no real eigenvalues, consider the subset E C M"^" of matrices having a real 
multiple eigenvalue as the set of ill-posed inputs. 

We define the condition number of A G M"^" for real eigenvalue computation by 

This definition is motivated by a result by Wilkinson [43], which (in slight reformu- 
lation) states that for all real eigenvalues A of ^ 

, , \/2|U||f . ^ 
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Thus 

'^eigenjR is a conic condition number; it varies continuously with A and takes 
the value oo when ^ G S. 

In [12], Demmel used the fact that the right-hand side of (11) is conic to obtain 
bounds on the tail of «;eigen,R(^) for random A. We next use this same fact to obtain 
smoothed analysis estimates. 

Proposition 4.3 For all n> 1 and < a < 1 we have 

sup E (lnKeigen,R(-^)) < 81nn + 21n- + 6. 

Proof. Let W be the set of matrices having multiple eigenvalues (real or complex). 

Clearly, E C W. In addition, W is the zero set of the discriminant polynomial, 
which is a homogeneous polynomial of degree — n in the entries of A (compare [6, 
Prop. 3.4]). Theorem 1.1 applied to the conic condition number jJ^^^][^^^ implies the 
stated bound. □ 



4.4 General Eigenvalue computations 

In addition to the computation of real eigenvalues, one can consider the compu- 
tation of general (possibly complex) eigenvalues for real (or complex) matrices A. 
Replacing the inner product in by a corresponding Hermitian product in C", the 
derivation of (9) still holds and one can now define 

/^eigen (^) := max k{A, A) 

where the maximum is over all eigenvalues A of ^. Replacing now S by the set W 
in the proof of Proposition 4.3, Wilkinson's result [43] still holds and yields 

- d\st{A,W) 

from which the following result easily follows (for part (ii) one considers complex 
numbers as pairs of real numbers, i.e., a complex matrix ^ as ^4 G M?"' ; this is 
actually the way programming languages deal with complex numbers). 

Proposition 4.4 For all n> 1 and < a < 1 we have 
(i) For real matrices A, 

sup E (lnKeigen(-^)) < 81nn + 21ni + 6. 

^g5n2-i ZeB{A,a) (T 
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(ii) For complex matrices A, 

sup E (lnKeigen(-^)) < 81nn + 21n- + (6 + 21n2). 

Proof. Part (i) is as in Proposition 4.4. For part (ii) one notes that W, as a 
subset of M^" , is the zero set of the real and imaginary parts of the discriminant 
polynomial, which have both degree — n. □ 



4.5 Solving real polynomial systems 

Let di, . . . , d„ G N \ {0}. We denote by Hd the vector space of polynomial systems 
/ = (/i, . . . , /„) with fi e M[Xo, . . . , Xn] homogeneous of degree dj. For f,g eUd 
we write 

a a 

where a = (ao, . . . ,q;„) is assumed to range over all multi-indices such that \a\ = 
ELo «fc = di and X" := X^°X^^ ■ ■ ■ X^". We endow the space Ha with the inner 
product {f,g) := J2'i=iifi^ 9i) , where 

Hereby, the multinomial coefficients are defined as 

\a) ao!ai! 

This inner product has the beautiful property of being invariant under the natural 
action of the orthogonal group 0{n + 1) on Hd- In the case of one variable, it was 
introduced by Weyl [42]. Its use in computational mathematics goes back at least 
to Kostlan [18]. 

Throughout this section, let ||/|| denote the corresponding norm of / G T^d- 
The Weyl inner product defines a Riemannian structure on the sphere S{TC(i) '■= 
{/ £ Hd I 11/11 = 1}. As in the previous sections, we endow this sphere with the 
corresponding projective distance dp. 

In a seminal scries of papers, M. Shub and S. Smale [23, 24, 25, 26, 27] studied 
the problem of, given / G TCd 'i^R C, to compute an approximation of a complex zero 
of /. They proposed an algorithm and studied its complexity in terms of, among 
other parameters, a condition number /Unorm(/) for /. In the following we will recall 
the definition of /inorm(/) adapted to the case of real systems and real zeros (see [4, 
Chapter 12] for details). 
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For a simple zero C G 5" of / G Hd one defines 

/^norm(/, C) := 11/11 || (I)/(C)|T,)"'diag( y^, . . . , 

where Df{C)\Tt^ denotes the restriction of the derivative of /: W^~^^ — > at ( to 
the tangent space T(;S"- = {v e M"+^ | {v,C) = 0} of S"" at C- (The norm on the 
right is the operator norm with respect to the Euclidean norm.) If ^ is not a simple 
root of / we set /i/norm(/) = oo. Note that /^norm(/, C) is homogeneous of degree 
in / and 

Shub and Smale [24] proved a condition number theorem for the condition num- 
ber /Ltnorm(/, C) complex polynomial systems / and complex roots (^. To describe 
a corresponding result in the real situation, consider for ( e S"' 

:= {g G TCd | ^ is a multiple zero of ^f}. 
Theorem 4.5 For a zero C & S"^ of f e S(Hd) we have 

Mnorm (/,C) 



dp(/,S^n5(7id)) 

Proof. The proof of [24] (see also [4, §12.4]) carries over immediately to the real 
situation. □ 

Let E C Hd be the set of systems / G Hd having a real multiple zero. Note that 
C S. Therefore, by Theorem 4.5, 

/Xnorm(/,C) < ^^(/^ ^ H 5(7^d)) ' 

We define the condition number for real polynomial solving as the right-hand side: 

1 



(/) ■■- 



dp(/,sns(7^d))' 



This is by definition a conic condition number: it varies continuously with / and 
takes the value oo when / G S. 

Proposition 4.6 For aJi di, . . . , G N \ {0} and all a G (0, 1] we have 

sup E (ln/XnormM(5)) < 21niV + 41nP-|-21nn + 21n- + 7, 

where N = dim Hd — 1 and V = di ■ ■ ■ dn is the Bezout number. 

Proof. Let W be the discriminant variety consisting of the systems / G Hd 
having a complex multiple zero. Then S C W. In addition, it is well known that 
W is the zero set of a multihomogeneous polynomial of total degree bounded by 
2n'D'^, where V = di - • • dn is the Bezout number (see, e.g., [6, Lemma 3.6]). The 
statement now follows immediately from Theorem 1.1. □ 
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Remark 4.7 (i) By the results in [23], the condition number /Xnorm(/) not only 
measures the maximum sensitivity of complex solutions to the input /, but 
it is also a crucial complexity parameter for algorithms approximating such 
solutions. While //norm,K (/) shares with //norm(/) the first property, it is not 
clear whether this is also the case for the second one. 

(ii) By considering complex numbers as pairs of real numbers one can see the prob- 
lem of, given / G Hd (or in Hd ^ C) , compute an approximation of a complex 
zero of / (studied by Shub and Smale [23, 24, 25, 26, 27]) as a problem over 
the reals. Proceeding as in §4.4, one can find bounds for the smoothed analysis 
of this problem similar to those in Proposition 4.6. 



4.6 Real zero counting 

Consider the problem of, given / G Hd, counting the number of real zeros of / in 5". 
Unlike the problems considered so far, this is a problem with a discrete output. This 
means that sensitivity considerations as described in the previous problems do not 
apply here. Yet, finite precision algorithms will require more precision to give a 
reliable output when the input / is close to the set S of systems with multiple real 
zeros, and will not give any such reliable output when / € S (since in this case, 
arbitrarilly small perturbations of / will change the output). It therefore makes 
sense to define as condition number for the counting problem 

cond(/) 



dist(/,S) 



Since cond(/) = /Unorm.R (/) for / G S(TCd), the bounds of Proposition 4.6 hold for 
cond(/) as well. We note, however, that in §4.5 we assumed a specific inner product 
on Hd (whose properties are crucial in the proof of Theorem 4.5). In contrast, for 
the smoothed analysis of cond(/), any inner product on Hd would do. 



Appendix 

The constants C{p,i) appearing in Theorem 2.7 depend only on p and i and are 
independent of the manifold M. We derive the expression for C{p,i) stated in 
Theorem 2.7 by selecting a simple enough M. Consider the boundary Ma = Ma{q) 
of the ball Bji(q, a) in of radius < a < 7r/2 centered at q (recall Example 2.6). 
According to Theorem 2.7 we have for < i < p — 1 

fiiiMa) = C{p, i)^ [ fiiiM^iz) n S'+')dSPiz). 

JSP 

Let p{z) denote the Riemannian distance from z G S"^ to S**"*"^. If p{z) < a, then 
the intersection Ma{z) fl 5'*"'"^ is the boundary of a ball in S"*"*"^. The radius 5{z) of 
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the sphere Mq,{z) fl S^'^^ satisfies cos a = cos p{z) ■ cos 5{z) by a well known formula 
of spherical trigonometry. From Example 2.6 we know that m{Maiz) n 5*+^) = 
Oi {cos 5{z)y. On the other hand, Ma{z) does not intersect S"*"*"^ if p{z) > a. 
^From these reasonings we obtain 

^Prom Lemma 2.1 and the definition of Jp^p-i-i{p) it follows that 

j-^^J0\pTR{S'+\p) = Oi+iOp_i_2(sin^)f-*-2 (cosp)^+^ 



We obtain 
1 

a 



I- j piiMo,{z)nS'+')dSP{z) = n cos ay {sin p)P-'-^ cos p dp. 

JSP Jo 



OiOi+iOp^i^2 , ,i (sin«)f-^-i 
-(cos a) 



Op ' ' p-i-l 
On the other hand, by Equation (5), 

Pi{Ma) = (^~^^Op.i{suiaf-'-\cosa)\ 

By comparing the last two equations, the asserted form of C(p, i) follows. □ 
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